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Abstract 

Classical problems of sorting and searching assume an underlying linear ordering of the ob- 
jects being compared. In this paper, we study a more general setting, in which some pairs of 
objects are incomparable. This generalization is relevant in applications related to rankings in 
sports, college admissions, or conference submissions. It also has potential applications in biol- 
ogy, such as comparing the evolutionary fitness of different strains of bacteria, or understanding 
input-output relations among a set of metabolic reactions or the causal influences among a set 
of interacting genes or proteins. Our results improve and extend results from two decades ago 
^ ■ of Faigle and Turan, who were the first to consider some of the problems considered here. 

A poset is defined as a set of elements with a transitive partial order where some pairs 
of elements may be incomparable. A measure of complexity of a poset is given by its width, 
which is the maximum size of a set of mutually incomparable elements. We consider algorithms 
that obtain information about a poset by queries that compare two elements. We consider two 
I complexity measures: query complexity, which counts only the number of queries, and total 

l/^ ■ complexity, which counts all operations. 

We present an algorithm that sorts a width w poset of size n and has query complexity 

■ 0{n{w + logn)), which is within a constant factor of the information-theoretic lower bound. We 
I also show that a variant of Mergesort has query complexity 0{wnlog ^) and total complexity 

■ 0{w^nlog Faigle and Turan have shown that the sorting problem has query complexity 
O (wnlog ^) but did not address the total compexity of the problem. 

Two problems related to sorting are the problem of finding the minimal elements in a poset 
KJi I and its generalization of finding the bottom k "levels" , called the k-selection problem. We give 

. efficient deterministic and randomized algorithms for finding the minimal elements with 0{wn) 

d ' query and total complexity. We provide matching lower bounds for the query complexity up to 

a factor of 2 and generalize the results to the A:-selcction problem. We also derive upper bounds 
on the total complexity of some other problems of a similar flavor, such as computing a linear 
extension of a poset and computing the heights of all elements. 

Many open problems remain, of which the most significant is to determine the precise total 
complexity of sorting, as well as the precise query and total complexity of fc-selection. It would 
also be interesting to find efficient static and dynamic data structures that play the same role 
for partial orders that heaps and binary search trees play for total orders. 
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1 Introduction 



Sorting is the process of determining the underlying linear ordering of a set S of n elements. 
Comparison algorithms, in which direct comparisons between pairs of elements of S are the only 
means of acquiring information about the linear ordering, form an important subclass, including 
such familiar algorithms as Heapsort, Quicksort, Mergesort, Shellsort and Bubblesort. 

In this paper we extend the theory of comparison sorting to the case where the underlying 
structure of the set S is a partial order, in which an element may be larger than, smaller than, or 
incomparable to another element, and the "larger-than" relation is transitive and irreflexive. Such a 
set is called a partially ordered set, or poset. This extension is applicable to many ranking problems 
where certain pairs of elements are incomparable. Examples include ranking college applicants, 
conference submissions, tennis players, strains of bacteria according to their evolutionary fitness, 
and points in under the coordinate-wise dominance relation. 

Our algorithms gather information by queries to an oracle. The oracle's response to a query 
involving elements x and y is either the relation between x and y or a statement of their incom- 
parability. In many applications, a query may involve extensive effort (for example, running an 
experiment to determine the relative evolutionary fitness of two strains of bacteria, or comparing 
the credentials of two candidates for nomination to a learned society). We therefore consider two 
measures of complexity for an algorithm or problem: the query complexity, which is the number of 
queries performed, and the total complexity, which is the number of computational operations of 
all types performed (basic operations include standard data structure operations involving one or 
two elements of the poset). 

A partial order on a set can be thought of as the reachability relation of a directed acyclic graph 
(DAG). More generally, a transitive relation (which is not necessarily irreflexive) can be thought 
of as the reachability relation of a general directed graph. In applications, the relation represents 
the direct and indirect causal influences among a set of variables, processes, or components of a 
system. We show that with negligible overhead, the problem of sorting a transitive relation reduces 
to the problem of sorting a partial order. Our algorithms thus allow one to reconstruct general 
directed graphs, given an oracle for queries on reachability from one node to another. As directed 
graphs are the basic model for many real-life networks including social, information, biological 
and technological networks (see [14] for a survey), our algorithms provide a potential tool for the 
reconstruction of such networks. 

There is a vast literature on algorithms for determining properties of an initially unknown total 
order by means of comparisons. Partial orders often arise in these studies as a representation of 
the "information state" at a general step of such an algorithm. In such cases the incomparability 
of two elements simply means that their true relation has not been determined yet. The present 
work is quite different, in that the underlying structure to be discovered is a partial order, and 
incomparability of elements is inherent, rather than representing temporary lack of information. 
Nevertheless, the body of work on comparison algorithms for total orders provides valuable tools 
and insights that can be extended to the present context (e.g. [H El [U [T2| [T3]). 

The model considered here was previously considered by Faigle and Turan [5], who presented 
two algorithms for the problem of sorting a partial ordered set, which they term "identification" 
of a poset. We formally describe their results in Section II. 2i A recent paper [15] considers an 
extension of the searching and sorting problem to partial orders that are either trees or forests. 



1 



1.1 Definitions 

To precisely describe the problems considered in this paper and our results, we require some formal 
definitions. A partially ordered set, or poset, is a pair V = {P,>-), where P is a set of elements 
and ;^ C P X P is an irreflexive, transitive binary relation. For elements a,b £ P, if (a, b) € y, we 
write a y b and we say that a dominates b, or that b is smaller than a. li a )/- b and b )/- a, we say 
that a and b are incomparable and write a ^ b. 

A chain C Q P is a subset of mutually comparable elements, that is, a subset such that for any 
elements Ci,Cj (z C, i j, either Cj >~ cj or cj >- Ci. An ideal / C P is a subset of elements such 
that if x G / and x y y, then y £ I. The height of an element a is the maximum cardinality of a 
chain whose elements are all dominated by a. We call the set {a : V6, 6 ;^ a or 6 7^ a} of elements 
of height the minimal elements. An anti-chain ^ C P is a subset of mutually incomparable 
elements. The width w{V) of poset V is defined to be the maximum cardinality of an anti-chain of 
V. 

A decomposition C oiV into chains is a family C = {Ci, C2, . . . , Cq} of disjoint chains such that 
their union is P. The size of a decomposition is the number of chains in it. The width w{V) is 
clearly a lower bound on the size of any decomposition of V. We make frequent use of Dilworth's 
Theorem, which states that there is a decomposition of V of size w{V). A decomposition of size 
w{V) is called a minimum chain decomposition. 

1.2 Sorting and /c-selection 

The central computational problems of this paper are sorting and k-selection. The sorting problem 
is to completely determine the partial order on a set of n elements, and the /c-selection problem is 
to determine the set of elements of height at most k — 1, i.e., the set of elements in the k bottom 
levels of the partial order. In both problems we are given an upper bound of w on the width of the 
partial order. 

In the absence of a bound on the width, the query complexity of the sorting problem is exactly 
(2) , in view of the worst-case example in which all pairs of elements are incomparable. In the 
classical sorting and selection problems, w = 1. Our interest is mainly in the case where w <^ n, 
since this assumption is natural in many of the applications. Furthermore, if w is of the same order 
as n, then it is easy to see that the complexity of sorting is of order n^, as in the case where no 
restrictions are imposed on the poset. 

Faigle and Turan [5] have described two algorithms for sorting posets, both of which have 
query complexity O (wnlog (In fact the second algorithm is shown to have query complexity 
0{n log Np), where Np is the number of ideals in input poset V. It is easy to see that Np = 0{n^) 
if V has width w, and that N-p = (n/w)^ if V consists of w incomparable chains, each of size n/w.) 
The total complexity of sorting posets has not been considered. However, the total complexity of 
the first algorithm given by Faigle and Turan depends on the subroutine for computing a chain 
decomposition (the complexity of which is not analyzed in [5]). It is not clear if there exists a 
polynomial-time implementation of the second algorithm. 

1.3 Techniques 

It is natural to approach the problems of sorting and /c-selection in posets by considering general- 
izations of the well-known algorithms for the case of total orders, whose running times are closely 
matched by proven lower bounds. Somewhat surprisingly, natural generalizations of the classic 
algorithms do not provide optimal poset algorithms in terms of total and query complexity. 
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In the case of sorting, the generahzation of Mergesort considered here loses a factor of w in 
its total complexity compared to the information-theoretic lower bound. Interestingly, one can 
achieve the information-theoretic lower bound on query complexity (up to a constant factor) by 
carefully exploiting the structure of the poset. We do not know whether it is possible to achieve 
the information-theoretic bound on total complexity. 

The seemingly easier problem of /c-selection still poses some challenges. In particular, nontrivial 
arguments are needed to obtain both lower and upper bounds. Moreover, there is a gap of factor 
2 between the lower and upper bound, even for the problem of finding minimal elements. 

1.4 Main Results and Paper Outline 

In Section [21 we briefly discuss an efficient representation of a poset. The representation is of size 
0{ujn), and it allows the relation between any two elements to be retrieved in time 0(1). 
In Sections 13.11 and 13.21 we prove the following main theorems: 

Theorem 1. There exists an algorithm for sorting a poset of width at most w over n elements with 
optimal query complexity 0{n(logn + w)). 

Theorem 2. A generalization of Mergesort for sorting a poset of width at most w over n elements 
has query complexity 0{vunlogn) and total complexity 0{w'^nlogn). The algorithm also provides 
a minimum chain decomposition of the set. 

In Section m we consider the fc-selection problem of determining the elements of height less than 
or equal to k — 1. We give upper and lower bounds on the query complexity and total complexity of 
/c-selection within deterministic and randomized models of computation. For the case k = 1 (finding 
the minimal elements), we show that the query complexity and total complexity are Q{wn). The 
query upper bounds match the query lower bounds up to a factor of 2. 

In Section[5l we give a randomized algorithm, based on a generalization of Quicksort, of expected 
total complexity 0{n{logn + w)) for computing a linear extension of a poset. We also give a 
randomized algorithm of expected total complexity 0{wnlogn) for computing the heights of all 
elements in a poset. 

Finally, in Section [Gj we show that the results on sorting posets generalize to the case when an 
upper bound on the width is not known and to the case of transitive relations. 

1.5 Acknowledgments: 

E.M. would like to thank Mike Saks for the reference to the work of Faigle and Turan [5]. 

2 Representing a poset: the ChainMerge data structure 

Once the relation between every pair of elements in a poset has been determined, some represen- 
tation of this information is required, both for output and for use in our algorithms. The simple 
ChainMerge data structure that we describe here supports constant-time look-ups of the relation 
between any pair of elements. It is built from a chain decomposition of the poset. 

Let C = {Ci,...Cg} be a chain decomposition of a poset V = {P,>-)- ChainMerge(7', C) 
stores, for each element x P, q indices as follows: Let Cj be the chain of C containing x. The 
data structure stores the index of x in Cj and, for all j, 1 < j < q, j i, the index of the largest 
element of chain Cj that is dominated by x. The performance of the data structure is characterized 
by the following lemma. 
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Claim 3. Given a query oracle for a poset V = (P, >-) and a decomposition C of V into q chains, 
building the ChainMerge data structure has query complexity at most 2qn and total complexity 
0{qn), where n = \P\. Given ChainMerge('P, C), the relation in V of any pair of elements can 
he found in constant time. 

Proof. The indices corresponding to chain Cj that must be stored for the elements in chain Cj can 
be found in 0(|Cj| + \Cj\) time, using |Cj| + \Cj\ queries, by simultaneously scanning Cj and Cj. 
Since each chain is scanned 2g — 1 times, it follows that the query complexity of ChainMerge('P, C) 
is at most 2qn, and the total complexity is 0{q ■ Yl\=i — 0{qn). 

Let x,y £ P, with x £ Ci and y € Cj. The look-up operation works as follows: If i = j, we 
simply do a comparison on the indices of x and y in Cj, as in the case of a total order. If i 7^ j, 
then we look up the index of the largest element of Cj that is dominated by x; this index is greater 
than (or equal to) the index of y in Cj if and only if x y y. If x ^ y, then we look up the index of 
the largest element of Cj that is dominated y; this index is greater than (or equal to) the index of 
X in Cj if and only if y y x. If neither x )~ y nor y y x, then x ^ y. □ 

3 The sorting problem 

We address the problem of sorting a poset, which is the computational task of producing a repre- 
sentation of a poset V = {P, y), given the set P of n elements, an upper bound of w on the width 
of V, and access to an oracle for V. (See Section 16.11 for a discussion of the case when an upper 
bound on the width is not known.) An information-theoretic lower bound on the query complexity 
of sorting is implied by the following theorem of Brightwell and Goodall [2] , which provides a lower 
bound on the number A^^(n) of posets of width at most w on n elements. 

Theorem 4. The number Nw{n) of partially ordered sets of n elements and width at most w 
satisfies 



3.1 An optimal sorting algorithm 

In this section, we describe a sorting algorithm that has optimal query complexity, i.e. it sorts a 
poset of width at most w on n elements using 0(nlogn -|- wn) oracle queries. Our algorithm is 
not necessarily computationally efficient, so in Section 13. 2| we consider efficient solutions to the 
problem. 

Before presenting our algorithm, it is worth discussing an intuitive approach that is different 
from the one we take. For any set of oracle queries and responses, there is a corresponding set of 
posets, which we call candidates, that are the posets consistent with the responses to these queries. 
A natural sorting algorithm is to find a sequence of oracle queries such that, for each query (or for 
a positive fraction of the queries), the possible responses to it partition the space of posets that 
are candidates after the previous queries into three parts, at least two of which are relatively large. 
Such an algorithm would achieve the information-theoretic lower bound (up to a constant). 

For example, the effectiveness of Quicksort for sorting total orders relies on the fact that most 
of the queries made by the algorithm partition the space of candidate total orders into two parts, 
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It follows that, for w = o 




log N^j (n) = Q{n log n + wn). 



4 



Algorithm Poset-BinInsertionSort(P) 

input: a set P, a query oracle for a poset V = {P, y), and upper bound of w on width of V 
output: a ChainMerge data structure for V 

1. V := ({e}, {}), where e € P is some arbitrary element; /* V' is the current poset*/ 

2. P':={e};n':={}; 

3. U := P\ {e}; /* U is the set of elements that have not been inserted */ 

4. while [/ =^ 

a. pick an arbitrary element e € [/; /* e is the element that will be inserted in P'*/ 

b. U:=U\{e}; 

c. find a chain decomposition C = {Ci, C2, . . . , Cq} of P', with q < w chains; 

d. for i = 1, . . . ,q 

i. let Ci = {en,. . . , eu^}, where eu^ y . . . y ei2 y en; 

ii. do binary search on Ci to find smallest element (if any) that dominates e; 
iii. do binary search on Ci to find largest element (if any) that is dominated by e; 

e. based on results of binary searches, infer all relations of e with elements of P'; 

f. add into TZ' all the relations of e with the elements oi P' ; P' := P' U {e}; 

g. v' = {p',n'y, 

5. find a decomposition C of V'; build ChainMerge('P', C) (no additional queries needed); 

6. return ChainMerge(7'', C); 

Figure 1: pseudo-code for Poset-BinInsertionSort 



each of relative size of at least 1/4. Indeed, in the case of total orders, much more is known: for 
any subset of queries, there is a query that partitions the space of candidate total orders, i.e. linear 
extensions, into two parts, each of relative size of at least 3/11 [lOj . 

In the case of width-u; posets, however, it could potentially be the case that most queries 
partition the space into three parts, one of which is much larger than the other two. For example, 
if the set consists of w incomparable chains, each of size n/w, then a random query has a response 
of incomparability with probability about 1 — 1/w. (On an intuitive level, this explains the extra 
factor of w in the query complexity of our version of Mergesort, given in Section [3.2[ ) Hence, we 
resort to more elaborate sorting strategies. 

Our optimal algorithm builds upon a basic algorithm that we call Poset-BinInsertionSort, 
which is identical to "Algorithm A" of Faigle and Turan [5] . The algorithm is inspired by the binary 
insertion-sort algorithm for total orders. Pseudocode for Poset— BinInsertionSort is presented 
in Figure [H The natural idea behind Poset-BinInsertionSort is to sequentially insert elements 
into a subset of the poset, while maintaining a chain decomposition of the latter into a number 
of chains equal to the width w of the poset to be constructed. A straightforward implementation 
of this idea is to perform a binary search on every chain of the decomposition in order to figure 
out the relationship of the element being inserted with every element of that chain and, ultimately, 
with all the elements of the current poset. It turns out that this simple algorithm is not optimal; 
it is off by a factor of w from the optimum. In the rest of this section, we show how to adapt 
Poset-BinInsertionSort to achieve the information-theoretic lower bound. 

We begin by analyzing Poset-BinInsertionSort. 

Lemma 5 (Faigle & Turan [5]). Poset-BinInsertionSort sorts any partial order V of width at 
most w on n elements using at most 0{wnlogn) oracle queries. 
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Proof. The correctness of Poset-BinInsertionSort should be clear from its description. (The 
simple argument showing that Step 4e can be executed based on the information obtained in Step 
4d is similar to the proof for the ChainMerge data structure in Section [2l) It is not hard to 
see that the number of oracle queries incurred by Poset— BinInsertionSort for inserting each 
element is 0{wlogn) and, therefore, the total number of queries is 0{wnlogn). □ 

It follows that, as n scales, the number of queries incurred by the algorithm is more by a factor 
of w than the lower bound. The Achilles' heel of the Poset-BinInsertionSort algorithm is in 
the method of insertion of an element — specifically, in the way the binary searches of Step 4d are 
performed. In these sequences of queries, no structural properties of V are used for deciding which 
queries to the oracle are more useful than others; in some sense, the binary searches give the same 
"attention" to queries whose answer would greatly decrease the number of remaining possibilities 
and those whose answer is not very informative. However, as we discuss earlier in this section, a 
sorting algorithm that always makes the most informative query is not guaranteed to be optimal. 

Our algorithm tries to resolve this dilemma. We suggest a scheme that has the same structure 
as the Poset-BinInsertionSort algorithm but exploits the structure of the already constructed 
poset V' in order to amortize the cost of the queries over the insertions. The amortized query cost 
matches the information-theoretic bound. 

The new algorithm, named EntropySort, modifies the binary searches of Step 4d into weighted 
binary searches. The weights assigned to the elements satisfy the following property: the number of 
queries it takes to insert an element into a chain is proportional to the number of candidate posets 
that will be eliminated after the insertion of the element. In other words, we spend fewer queries 
for insertions that are not informative and more queries for insertions that are informative. In some 
sense, this corresponds to an entropy-weighted binary search. To define this notion precisely, we 
use the following definition. 

Definition 1. Suppose that V' = {P',TZ') is a poset of width at most w, U a set of elements such 
thatUnP' = $,ueU and£n,Vn C {{u}xP')u{P' x{u}). We say thatV = {P'UU,TZ) zs a width 
w extension of V' on U conditioned on {£TZ,VTZ), if V is a poset of width w, TZCi {P' x P') = TZ' 
and, moreover, £TZ C TZ, TZDVTZ = In other words, V is an extension ofV' on the elements of 
U which is consistent with V' , it contains the relations of u to P' given by £TZ and does not contain 
the relations of u to P' given by VTZ. The set STZ is then called the set of enforced relations and 
the set VTZ the set of prohibited relations. 

We give in Figure [2] the pseudocode of Step 4d' of EntropySort, which replaces Step 4d of 
Poset-BinInsertionSort. 

The correctness of the EntropySort algorithm follows trivially from the correctness of Poset- 
BinInsertionSort. We prove next that its query complexity is optimal. Recall that Nyj{n) 
denotes the number of partial orders of width w on n elements. 

Theorem 6. EntropySort sorts any partial order T' of width at most w on n elements using at 
most 21og A'^(n) + 4wn = B(nlogn + wn) oracle queries. In particular, the query complexity of 
the algorithm is at most 2n log n + 8wn + 2w log w. 

Proof. We first characterize the number of oracle calls required by the weighted binary searches. 

Lemma 7 (Weighted Binary Search). For every j € {1, 2, . . . , £j + 1}, if eij is the smallest element 
of chain Ci which dominates element e (j = ^i + 1 corresponds to the case where no element of 
chain Ci dominates e), then j is found after at most 2 • (1 + log w^) oracle queries in Step v. of 
the algorithm described above. 
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Step 4d' for Algorithm EntropySort(P) 

4d'. en = %;vn = 0; 

for i = 1, . . . , g 

i. let Ci = {en, Cj^J, where eu^ >-...>- ei-2 >- en; 

ii. for j = 1, . . . ,£i + 1 

• set ETZj = {{eik,e)\j <k< ii}; set VTZj = {{eik,e)\l <k< j}; 

• compute "Dij, the number of w-extensions of V on U, 
conditioned on (STZ U £nj,Vn U VTlj); 

/* T>ij represents the number of posets on P consistent with V\ (STZ, VTZ), 
in which e^j is the smallest element of chain Ci that dominates e; 
j = ii + 1 corresponds to the case that no element of Ci dominates e;*/ 
endfor 

iii. set Vi = Ei=i^Ai; 

/* Vi is equal to the total number of i(;-extensions of V' on U 
conditioned on {ElZ^VTZ)* / 

iv. partition the unit interval [0, 1) into ii + 1 intervals {[bj,tj)Yj''J^^ , 
where 6i = 0, bj = tj-i, for all j > 2, 

and tj = {Yjj'<j for ah j > 1. 

/* each interval corresponds to an element of Ci or a "dummy" element Cj^.+i */ 
V. do binary search on [0, 1) to find smallest element (if any) of Ci that dominates e: 
/* weighted version of binary search in Step 4dii of Poset-BinInsertionSort */ 
set X = 1/2; t = 1/4; f = 0; 
repeat: find j such that x S [6j,tj); 

if (j = ii + 1 and ejj_i e) OR (cy >- e and j = 0) OR (e^ >- e and ejj_i )f e) 

set j* = j; break; /* found smallest element in Ci that dominates e */ 
else if {j = ii + l) OR {eij >- e) 

set X = X — t; t = t * 1/2; /* look below */ 
else 

set X = X + t; t = t * 1/2; /* look above */ 

vi. eij* is the smallest element of chain Ci that dominates e; 
set £'7^ := £'7^ U STZj* and Vn:=Vn[J VTlj* ; 

vii. find the largest element (if any) of chain Ci that is dominated by e: 
for j = 0, 1, . . .,ii, 

compute V'-, the number of posets on P consistent with V' , {£TZ, VTZ), 

in which eij is the largest element of chain Ci dominated by e; 
/* j = corresponds to case that no element of Cj is dominated by e; */ 

do the weighted binary search analogous to that of Step v; 

viii. update accordingly the sets 8TZ and VTZ; 
endfor 

Figure 2: Algorithm EntropySort is obtained by substituting Step 4d' above for Step 4d 
of the pseudo-code in Figured] for Poset-BinInsertionSort. 



7 



Proof. Let A = be the length of the interval that corresponds to Cij. We wish to prove that the 
number of queries needed to find Cij is at most 2(1 + [log ^ J)- From the definition of the weighted 
binary search, we see that if the interval corresponding to eij contains a point of the form ■ m 
in its interior, where r, m are integers, then the search reaches Sij after at most r steps. Now, an 
interval of length A must include a point of the form 2""^ ■ m, where r = 1 + [ log j J , which concludes 
the proof. □ 

It is important to note that the number of queries spent by the weighted binary search is small 
for uninformative insertions, which correspond to large I^ij's, and large for informative ones, which 
correspond to small Pij's. Hence, our use of the term entropy- weighted binary search. A parallel 
of Lemma [7] holds, of course, for finding the largest element of chain Ci dominated by element e. 

Suppose now that P = {ei, . . . , e„}, where ei, 62, . . . , is the order in which the elements 
of P are inserted into poset V . Also, denote by Vk the restriction of poset V onto the set of 
elements {ei, 62, Cfc} and by the number of width w extensions of poset Vk on P\{ei, . . . , Cfc} 
conditioned on (0,0). Clearly, Zq = A^^(n) and Z„ = 1. The following lemma is sufficient to 
estabhsh the optimality of EntropySort. 

Lemma 8. EntropySort needs at most 4?/; + 2 log y ^ oracle queries to insert element Ck+i into 
poset Vk in order to obtain Vk+i- 



Proof. Let C = {Ci, . . . , Cq\ be the chain decomposition of the poset Vk constructed at Step 4c of 
EntropySort in the iteration of the algorithm in which element ek+i needs to be inserted into 
poset Vk- Suppose also that, for alH S {1, . . . , g}, vTj G {1, . . . , £j + 1} and Kj G {0, 1, . . . are 
the indices computed by the binary searches of Steps v. and vii. of the algorithm. Also, let Pj, 
Vij, j e {1, . . . , ii + 1}, and 2?^, V^j, j e {0, . . . be the quantities computed at Steps ii., iii. 
and vii. It is not hard to see that the following are satisfied 

Vi^^ =Vi,\/i=l,...,q V[,^ = A+i, Vi = !,...,(?-! 

Now, using Lemma [71 it follows that the total number of queries required to construct Vk+i from 
Vk is at most 

V (2 + 21og-^ + 2 + 21og-^ 1 <47i; + 21og/' 



i=l 



k+1 



□ 



Using Lemma [HI the query complexity of EntropySort is 

n— 1 n— 1 



/ z \ 

(# queries needed to insert element efc_|_i) = | Aw + 2 log ^ \ 



k=0 k=0 



= Awn + 2 log — ^ = Awn + 2 log Nu,{n). 

Zn 

Taking the logarithm of the upper bound in Theorem [H it follows that the number of queries 
required by the algorithm is 2n log n + 8wn + 2w log w. 

□ 
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Algorithm Poset-Mergesort(P) 

input: a set P, a query oracle for a poset V = {P, y), and upper bound w on width of V 
output: a ChainMerge data structure for V 

run Poset-Mergesor.t-Recurse(P) producing a decomposition C of V into w chains; 
build and return ChainMerge(P, C); 

Procedure Poset-Mergesort-Recurse(P') 

input: a subset P' C P, a query oracle for V = (P, y), an upper bound w on the width of V 
output: a decomposition into at most w chains of the poset V' induced by >- on P' 

if \P'\ < w 

then return the trivial decomposition of V' into chains of length 1 
else 

1. partition P' into two parts of equal size, P{ and P2, 

2. run Poset-Mergesort-Recurse(P{) and Poset-Mergesort-Recurse(P2); 

3. collect the outputs to get a decomposition C of V' into q < 2w chains; 

4. if q > w, run Peeling('P, C), to get a decomposition C of V' into w chains; 
return C'; 

Figure 3: pseudo-code for Poset-Mergesort 



3.2 An efficient sorting algorithm 

In this section, we turn to the problem of efficient sorting. Superficially, the Poset-Mergesort 
algorithm that we present has a recursive structure that is similar to the classical Mergesort algo- 
rithm. The merge step is quite different, however; it makes crucial use of the technical Peeling 
algorithm in order to efficiently maintain a small chain decomposition of the poset throughout the 
recursion. The Peeling algorithm, described formally in Section 13.2.21 is a specialization of the 
classic flow-based bipartite-matching algorithm [6] that is efficient in the comparison model. 

3.2.1 Algorithm Poset-Mergesort 

Given a set P, a query oracle for a poset V = {P, y), and an upper bound of w on the width of V, 
the Poset-Mergesort algorithm produces a decomposition of V into w chains and concludes by 
building a ChainMerge data structure. To get the chain decomposition, the algorithm partitions 
the elements of P arbitrarily into two subsets of (as close as possible to) equal size; it then finds a 
chain decomposition of each subset recursively. The recursive call returns a decomposition of each 
subset into at most w chains, which constitutes a decomposition of the whole set P into at most 
2t(; chains. Then the Peeling algorithm of Section [3.2.21 is applied to reduce the decomposition to 
a decomposition of w chains. Given a decomposition of P' C P, where m = \P'\, into at most 2w 
chains, the Peeling algorithm returns a decomposition of P' into w chains using 2wm queries and 
0{w'^m) time. Figure [3] shows pseudo-code for Poset-Mergesort. 

Theorem 9. Poset-Mergesort sorts any poset V of width at most w on n elements using at 
most 2wnlog{n/w)) queries, with total complexity 0{w'^nlog{n/w)). 
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Proof. The correctness of Poset-Mergesort is immediate. Let T{m) and Q{m) be the worst-case 
total and query complexity, respectively, of the procedure Poset-Mergesort-Recurse on a poset 
of width w containing m elements. When m < w, T{m) = 0{w) and Q{m) = 0. When m > w, 
T{m) = 2T(m/2) + 0{uP"m) and Q{m) < 2Q{m/2) + 2wm. Therefore, T[n) = 0{'uP"n\og{n/w)) 
and Q{n) < 2wnlog(n/w). The cost incurred by the last step of the algorithm, i.e. that of building 
the ChainMerge, is negligible. □ 

3.2.2 The Peeling algorithm 

In this section we present an algorithm that efficiently reduces the size of a given decomposition 
of a poset. It can be seen as an adaptation of the classic flow-based bipartite-matching algorithm 
[6] that is designed to be efficient in the oracle model. The Peeling algorithm is given an oracle 
for poset V = (P,y), where n = \P\, and a decomposition of P into at most 211) chains. It first 
builds a ChainMerge data structure using at most 2qn queries and time 0{qn). Every query the 
algorithm makes after that is actually a look-up in the data structure and therefore takes constant 
time and no oracle call. 

The Peeling algorithm proceeds in a number of peeling iterations. Each iteration produces a 
decomposition of V with one less chain, until after at most w peeling iterations, a decomposition 
of V into w chains is obtained. A detailed formal description of the algorithm is given in Figure [H 

Theorem 10. Given an oracle for V = {P,y), where n = \P\, and a decomposition of V into at 
most 2w chains, the Peeling algorithm returns a decomposition ofV into w chains. It has query 
complexity at most 2wn and total complexity 0{w'^n). 

Proof. To prove the correctness of one peeling iteration, we first observe that it is always possible 
to find a pair (x, y) of top elements such that y >~ x, as specified in Step la, since the size of any 
anti-chain is at most the width of V, which is less than the number of chains in the decomposition. 
We now argue that it is possible to find a subsequence of dislodgements as specified by Step 2a. 
Let yt be the element defined in step 3 of the algorithm. Since yt was dislodged by xt, xt was the 
top element of some list when that happened. In order for xt to be a top element, it was either top 
from the beginning, or its parent yt-i must have been dislodged by some element xt-i, and so on. 

We claim that, given a decomposition into q chains, one peeling iteration produces a decompo- 
sition of V into q — I chains. Recall that yi >- xi and, moreover, for every i, 2 < i < t, yi y Xi, 
and yi~i >- Xi. Observe that after Step 4 of the peeling iteration, the total number of pointers has 
increased by 1. Therefore, if the link structure remains a union of disconnected chains, the number 
of chains must have decreased by 1, since 1 extra pointer implies 1 less chain. It can be seen that 
the switches performed by Step 4 of the algorithm maintain the invariant that the in-degree and 
out-degree of every vertex is bounded by 1. Moreover, no cycles are introduced since every pointer 
that is added corresponds to a valid relation. Therefore, the link structure is indeed a union of 
disconnected chains. 

The query complexity of the Peeling algorithm is exactly the query complexity of Chain- 
Merge, which is 2wn. We show next that one peeling iteration can be implemented in time 
0{qn), which implies the claim. 

In order to implement one peeling iteration in time 0{qn), a little book-keeping is needed, in 
particular, for Step 2a. We maintain during the peeling iteration a list L of potentially-comparable 
pairs of elements. At any time, if a pair (x, y) is in L, then x and y are top elements. At the 
beginning of the iteration, L consists of all pairs (x, y) where x and y are top elements. Any 
time an element x that was not a top element becomes a top element, we add to L the set of all 
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Algorithm Peeling('P, C) 

input: a query oracle for poset V = {P, y), an upper bound of w on the width of V, 

and a decomposition C = {Ci, . . . , Cq} of V, where q < 2w 
output: a decomposition of V into w chains 

build ChainMerge(P,C); /* All further queries are look-ups. */ 
for i = 1, . . . ,q 

construct a linked list for each chain Ci = en. 6^2 Cji, 

where eu^ ^ ■ ■ ■ y ei2 y en; 
while q > w, perform a peeling iteration: 

1. for i = 1, . . . , g, set C'^ = Ci\ 

2. while every C[ is nonempty 

/* the largest element of each is a top element * / 

a. find a pair {x,y), x G C'^, y G Cj, of top elements such that y >- x; 

b. delete y from Cj; /* x dislodges y * / 

3. in sequence of dislodgements, find subsequence (xi, yi), . . . , (xt,yt) such that: 

• yt is the element whose deletion (in step 2b) created an empty chain; 

• for i = 2, . . . ,t, yi-i is the parent of Xi in its original chain; 

• xi is the top element of one of the original chains; 

4. modify the original chains Ci, . . . , Cq: 

a. for i = 2, . . . ,t 

i. delete the pointer going from to xf, 

ii. replace it with a pointer going from yi to Xi] 

b. add a pointer going from yi to Xi; 

5. set q = q — 1, and re-index the modified original chains from 1 to (7 — 1; 
return the current chain decomposition, containing w chains 



Figure 4: pseudo-code for the Peeling Algorithm 
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pairs (x, y) such that y is currently a top element. Whenever a top element x is dislodged, we 
remove from L all pairs that contain x. When Step 2a requires us to find a pair of comparable top 
elements, we take an arbitrary pair (x, y) out of L and check if x and y are comparable. If they are 
not comparable, we remove (x, y) from L, and try the next pair. Thus, we never compare a pair of 
top elements more than once. Since each element of P is responsible for inserting at most q pairs 
to L (when it becomes a top element), it follows that a peeling iteration can be implemented in 
time 0{qn). 

□ 

4 The /c-selection problem 

The A;-selection problem is the natural problem of finding the elements in the bottom k layers, i.e., 
the elements of height at most k — 1, of a poset V = (-P, y), given the set P of n elements, an upper 
bound w on the width of V, and a query oracle for V. We present upper and lower bounds on the 
query and total complexity of A:-selection, both for deterministic and randomized computational 
models, for the special case of A: = 1 as well as the general version. While our upper bounds arise 
from natural generalizations of analogous algorithms for total orders, the lower bounds are achieved 
quite differently. We conjecture that our deterministic lower bound for the case of A; = 1 is actually 
tight, though the upper bound is off by a factor of 2. 

4.1 Upper bounds 

In this section we analyze some deterministic and randomized algorithms for the A;-selection prob- 
lem. We begin with the 1-selection problem, i.e., the problem of finding the minimal elements. 

Theorem 11. The minimal elements can be found deterministically with at most wn queries and 
0{wn) total complexity. 

Proof. The algorithm updates a set of size w of elements that are candidates for being smallest 
elements. Initialize Tq = 0. Assume that the elements At step t: 

• Compare xt to all elements in Tt-i- 

• If there exists some a G Tj_i such that xt >- a, do nothing. 

• Otherwise, remove from Tt^i all elements a such that a>- xt and put xt into Tt. 

At the termination of the algorithm, the set T„ contains all height elements. By construction 
of Tt, for all t, the elements in Tt are mutually incomparable. Therefore, for all t, it holds that 
\Tt\ <:W, and hence the query complexity of the algorithm is at most wn. □ 

Theorem 12. There exists a randomized algorithm that finds the minimal elements in an expected 
number of queries that is upper bounded by 

w + 1 w"^ — w 

^ n H ^ (logn - logu;). 

Proof. The algorithm is similar to the algorithm for the proof of Theorem [TTl with modifications 
to avoid (in expectation) worst-case behavior. Let o" be a permutation of [n] chosen uniformly at 
random. Let Ti = {a^o-(i)}- For 1 < t < n, at step t: 
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• Let i be an index of the candidates in T(__i, i.e. Tt-i = . . . , where r < w. 

• Let Tt = Tt^i. Let r be a permutation of [r] chosen uniformly at random. 

• For j = 1, . . . ,r: 

— If Xo-(i-) y exit the loop and move to step t + 1. 

- If Xj(^(j)) >- remove from Tt. 

• Add x^(^t) to Tt. 

As in the previous algorithm, it is easy to see that at each step t, the set Tt contains all the minimal 
elements of At = {x^(^i^, . . . and that \Tt\ < w. Note furthermore that at step t, 

w 

P[Xo-(j) is minimal for ^4^] < —. 

If x^(^t) is not minimal for At, then the expected number of comparisons needed until x^(^t) is 
compared to an element a At that dominates x^^t) is clearly at most {w + l)/2. We thus conclude 
that the expected running time of the algorithm is bounded by: 

. . / W (t — w)(w + l)\ / ■w\ 1/9 \ 

(t-i)+ E (7-+ ' t 2 ) = (2J+ 2: 

t=2 t=w+l ^ / \ / t=w+l 

W + 1 w"^ — W 
< — - — nH (log n - log It;) 

□ 

We now turn to the fc-selection problem for A; > 1. We first provide deterministic upper bounds 
on query and total complexity. 

Theorem 13. The query complexity of the k-selection problem is at most 

16wn + 4n log {2k) + 6n logw. 
Moreover, there exists an efficient k-selection algorithm with query complexity at most 

Swnlog {2k) 

and total complexity 

0{w^nlog{2k)). 

Proof. The basic idea is to use the sorting algorithm presented in previous sections in order to 
update a set of candidates for the /c-selection problem. Denote the elements by xi,. . . ,Xn. Let 
Co = 0. The algorithm proceeds as follows, beginning with t=l: 

• While {t - l)wk + I < n, let A = Ct-i U {xi^t-i)wk+i, • • • , Xmm{twk,n)}- 

• Sort Dt. Let Ct be the solution of the /c-selection problem for Dt. 

Clearly, at the end of the execution, the last Ct will contain the solution to the fc-selection prob- 
lem. As we have shown, the query complexity of sorting Dt is Awklog {2wk) + 16w^k + 2wlogw 
and, therefore, the query complexity of the algorithm is ^^{Awklog {2wk) + IGw'^k + 2wlogw) = 
4n\og{2wk) + 16wn + ^logw. This proves the first result. Using the computationally efficient 
sorting algorithm, we have sorting query complexity 8w'^klog{2k) which results in total query 
complexity Snu) log (2fe) and total complexity 0{nw'^ log{2k)). □ 
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Next we outline a randomized algorithm with a better coefficient of the main term wn. 



Theorem 14. The k-selection problem has a randomized query complexity of at most 



wn + IGhw"^ lognlog(2/c) 



and total complexity 



0{wn + poly{k, w) log n). 



Proof. We use the following algorithm: 

• Choose an ordering xi, . . . , a;„ of the elements uniformly at random. 

• Let Cyjk = {a^i, • • • , Xwk} and D^jk = 0- 

• Sort Cwk ■ Remove any elements from Cy^k that are of height greater than k — 1. 

• Let t = wk + 1. While t <n do: 

— Let Ct = Ct-i and A = A-i- 

— Compare Xt to the maximal elements in Ct in a random order. 

* For each maximal element a Ct- if height (a) = k — 1 and a xt, or if height (a) < 
k — 1 and xt y a, then add xt to Dt, and exit this loop. 

* If for all elements a (z Ct, xt a, then add xt to Dt and exit this loop; 

— If \Dt\ = wk or t = n: 



• Output the elements of C„. 

It is clear that Cn contains the solution to the /c-selection problem. To analyze the query complexity 
of the algorithm, recall from Theorem [9] that s{w,k) = 8w'^klog{2k) is an upper bound on the 
number of queries used by the efficient sorting algorithm to sort 2wk elements in a width-tt) poset. 

There are two types of contributions to the number of queries made by the algorithm: (1) 
comparing elements to the maximal elements of Ct, and (2) sorting the sets Cq and CtU Dt- 

To bound the expected number of queries of the first type, we note that for t > kw + 1, since 
the elements are in a random order, the probability that xt ends up in Dt is at most min 
If Xt is not going to be in Dt, then the number of queries needed to verify this is bounded by w. 
Overall, the expected number of queries needed for comparisons to maximal elements is bounded 
by wn. 

To calculate the expected number of queries of the second type, we bound the expected number 
of elements that need to be sorted as follows: 



* Sort CtUDt. 

* Set Ct to be the elements of height at most k — 1 in CtU Dt. 

* Set Dt = 0. 




We thus obtain that the total query complexity is bounded above by wn + 2s{w, k) logn. 



□ 
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4.2 Lower bounds 



We obtain lower bounds for the /c-selection problem both for adaptive and non-adaptive adversaries. 
Some of our proofs use the following lower bound on finding the fc-th smallest element of a total 
order on n elements: 

Theorem 15 (Fussenegger-Gabow [5]). The number of queries required to find the kth smallest 
element of an n-element total order is at least n — k + log (;,.";^)- 

The proof of Theorem [15] shows that every comparison tree that identifies the fcth smallest element 
must have at least 2"'~'^(^"-^) leaves, which implies that the theorem also holds for randomized 
algorithms. 

4.2.1 Adversarial lower bounds 

We consider adversarial lower bounds for the /c-selection problem. In this model, an adversary 
simulates the oracle and is allowed to choose her response to a query after she receives it. Any 
response is legal as long as there is some partial order of width w with which all of her responses 
are consistent. We begin with the case of /c = 1, i.e. finding the set of minimal elements. 

Theorem 16. In the adversarial model, at least ^^^-^n — w comparisons are needed in order to find 
the minimal elements. 

Proof. Consider the following adversarial algorithm. The algorithm outputs query responses that 
correspond to a poset Vofw disjoint chains. Given a query q{a, b), the algorithm outputs a response 
to the query, and in some cases, it may also announce for one or both of a and b to which chain 
the element belongs. Note that receiving this extra information can only make things easier for the 
query algorithm. During the course of the algorithm, the adversary maintains a graph G = (P, E). 
Whenever the adversary responds that a 9*^ 6, it adds an edge (a, b) to E. 

Let qtia) be the number of queries that involve element a, out of the first t queries overall. Let 
c(a) be the chain assignment that the adversary has announced for element a. (We set c(a) to be 
undefined for all a, initially.) Let {xj}"^^ be an indexing, chosen by the adversary, of the elements 
of P. Let q{a, b) be the t'th query. The adversary follows the following protocol: 

• If qt{a) < w — 1 or qt{b) < w — 1, return a ^b. In addition: 

— \i qt(a) = — 1, choose a chain c(a) for a that is different from all the chains to which 
a's neighbors in G belong, and output it. 

— If = w — \ choose a chain c(6) for b that is different from all the chains to which 
6's neighbors in G belong, and output it. 

• If qt(a) > — 1, > w — 1, and c(a) 7^ c(6), then output a ^b. 

• Otherwise, let i and j be the indices of a and 6, respectively (i.e. a = Xi and b = Xj). If i > j, 
then output a y b; otherwise, output b y a. 

It is easy to see that the output of the algorithm is consistent with a width-tM poset consisting of 
w chains that are pairwise incomparable. We will also require that each of the chains is chosen at 
least once (this is easily achieved). 

We now prove a lower bound on the number of queries to this algorithm required to find a proof 
that the minimal elements are indeed the minimal elements. 
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In any proof that a is not a smallest element, it must be shown to dominate at least one other 
element, but to get such a response from the adversary, a must be queried against at least w — \ 
other elements with which it is incomparable. To prove that a minimal element of one chain is 
indeed minimal, it must be queried at least against the minimal elements of the other chains to 
rule out the possibility it dominates one of them. Therefore, each element must be compared to 
at least w — \ elements that are incomparable to it. So the total number of queries of type q{a, b), 
where a 9^ 6, is at least ^^^^n. 

In addition, for each chain Cj of length rij, the output must provide a proof of minimality for 
the minimal element of that chain. By Theorem 1151 this contributes rij — 1 comparisons for each 
chain a. 

Summing over all the bounds proves the claim. □ 

Theorem 17. Let r = 2w~i ■ k < r then the number of queries required to solve the k-selection 
problem is at least 



(w + l)n , , s w'^ ■ i , 1 \ 1 / ■ 1 

w[k + log K) h mm [w — I) log + log 



rw 



2 ' °' 8 V ' \k-lj "\k-l 

nir — k)(w — 1) , f n — iw — l)k 
+ log' 



2r "\k-l 

Proof. The adversarial algorithm outputs query responses exactly as in the proof of Theorem [TBI 
except in the case where the tth query is (a, b) and qt{o) = w — 1 or qt{b) = w — 1. In that case it 
uses a more specific rule for the assignment of one or both of these elements to chains. 

In addition to assigning the elements to chains, the process must also select the k smallest 
elements in each chain, and the Fussenegger-Gabow theorem (Theorem I ISp gives a lower bound, in 
terms of the lengths of the chains, on the number of comparisons required to do so. 

We think of the assignment of elements to chains as a coloring of the elements with w colors. 
The specific color assignment rule is designed to ensure that, if the number of elements with color 
c is small, then there must have been many queries in which the element being colored could not 
receive color c because it had already been declared incomparable to an element with color c. It 
will then follow that there have been a large number of queries in which an element was declared 
incomparable to an element with color c. Thus, if many of the chains are very short, then the 
number of pairs declared incomparable must be very large. On the other hand, if few of the chains 
are very short, then we can employ the Fussenegger-Gabow Theorem to show that the number of 
comparisons required to select the k smallest elements in each chain must be large. We obtain the 
overall lower bound by playing off these two observations against each other. 

The color assignment rule is based on a function dt(c), referred to as the deviation of color c 
after query t, and satisfying the initial condition (io(c) = for all c. The rule is: "assign the eligible 
color with smallest deviation." 

More specifically, let the tth query be {at,bt). The adversary processes at and then bt- Recall 
that qtia) is the number of queries involving element a out of the first t queries overall. Element 
e € {at, bt} is processed exactly as in the proof of Theorem [T6] except when qt{e) = vu — 1. In that 
case, let St{e) be the set of colors that are not currently assigned to neighbors of e; i.e., the set of 
colors eligible to be assigned to element e. Let c* = argmin^g^j^^g) dj„i(c). The adversary assigns 
color c* to e. Then the deviations of all colors are updated as follows: 

1. if c St{e) then dt{c) = dt-i{c)] 

2. dt{c*)^dt-i{c*) + l-j^^; 
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3. For c G St{e) \ {c*}, dt{c) ^ dt.i{c) - 

The function dt{c) has the following interpretation: over the history of the color assignment 
process, certain steps occur where the adversary has the choice of whether to assign color c to 
some element; dt{c) represents the number of times that color c was chosen up to step t, minus the 
expected number of times it would have been chosen if the same choices had been available at all 
steps and the color had been chosen uniformly at random from the set of eligible colors. 

Because the smallest of the deviations of eligible colors is augmented at each step, it is not 
possible for any deviation to drift far from zero. Specifically, it can be shown by induction on t 
that at every step t the sum of the deviations is zero and for m = 1,2, ... ,w, the sum of the m 
smallest deviations is greater than or equal to '"("^~"') ^ 

Let degQ(o) be the degree of a in G at the end of the process. At the end of the process every 
element of degree greater than or equal to ^« — 1 in G has been assigned to a chain. Each element 
of degree less than w — 1 has not been assigned to a chain, and is therefore called unassigned. An 
unassigned element is called eligible for chain c if it has not been compared (and found incomparable) 
with any element of chain c. Let s(c) be the length of chain c and define def(c), the deficiency of 
chain c, as max(0. A; — s(c)). Define the total deficiency def as the sum of the deficiencies of all 
chains. 

Let u be the number of unassigned elements. Upon the termination of the process it must 
be possible to infer from the results of the queries that every unassigned element is of height at 
most k — 1. This implies that, if unassigned element x is eligible for chain c, then the number of 
unassigned elements eligible for chain c must be at most def(c). Thus the number of pairs (a, c) 
such that unassigned element a is eligible for chain c is DEF. Define the deficiency of unassigned 
element a as it; — 1 — deg(^(a). Then the sum of the deficiencies of the unassigned elements is 
bounded above by def, and therefore the sum of the degrees in G of the unassigned elements is at 
least {w — l)u — DEF. 

By Theorem [HI if s(c) > k, then at least ^s(c) — A; + log (^^^)^ comparisons are needed to 
determine the k smallest elements of chain c. 

The total number of comparisons is the number of edges that have been placed in G in the 
course of the algorithm (i.e., the number of pairs that have been declared incomparable by the 
adversary), plus the number of comparisons required to perform /c-selection in each chain. The 
total number of pairs that have been declared incomparable is ^ deg(^(a). 

Let d{c) be the deviation of color c at the end of the process. Let r(c) be the number of 
steps in the course of the process at which the element being colored was eligible to receive color 
c. If, at each such step, the color had been chosen uniformly from the set of eligible colors, 
then the chance of choosing color c would have been at least ^. Thus, by the interpretation 

of the function dt{c) given above, s{c) > + d{c); equivalently, r(c) < w{s{c) — d{c)). Also, 
Yla\c{a)=c ^^Sg{^) ^ n — r(c) > n — w{s{c) — d{c)). This sum is also at least {w — l)s{c), since every 
element assigned to c has been declared incomparable with at least {w — 1) other elements. 

We can now combine these observations to obtain our lower bound. For each chain c define 
cost(c) = i Ea|c(a)=c degG(a)+max (o, s{c) - + log (,!L1)) • Then Zc cost(c)+i {{w - l)u - def) 
is a lower bound on the total number of comparisons, and 

cost(c) > - max {{w — l)s{c),n — w{s{c) — d{c))) 

c c 

+ E 

c\s{c)>k 



17 



To obtain our lower bound we shall minimize this function over all choices of nonnegative 
integers s{c), u and def such that ^c^i*^) + u = n and def = ^^max(0, A; — s(c)). Noting that 

min((i(c), 0) > iaimmm{m — w)/2 = —w'^/8, we obtain the following lower bound on the total 
number of comparisons: 

(w — l)n DEF w'^ 1 , , N , NX / , N , , / sic) \ \ , , 
— + -^max(0,n-(2z.-l)s(c))+ K(c) - A. + log L j (1) 

c c\s{c)>k ^ V / / 

We now restrict attention to the case k < 2w-i • ^ ~ 2w-i • shall show that, at any 
global minimum of ([1]), DEF = 0. To see this, consider any choice of {s(c)} such that DEF > 0. Let 
c be a chain such that def(c) > 0. If s{c) is increased by 1, then def decreases by 1, and the net 
change in the value of quantity ^ is 1 — w, which is negative. 

Thus, in minimizing ([1]) we may assume that DEF = 0, and hence that s(c) = n. So ([1]) may 
be rewritten as 

(w — l)n \ - ^, , 



where 

F(s) 



( ^ max(0, n — {2w — l)s) if s < k 

y i max(0, n-{2w - l)s) + {s - k + \og (fclj) if s > k. 

Thus, we have the following minimization problem: 



Minimize ^^F(s(c)), subject to s{c) > and s{c) 



n. 



First, we note that '^c.\k<s{c)^^i^) = ~ wk. To determine the minimum we consider three 
ranges of values: the low range s = k, medium range k < s{c) < r, and high range r < s(c) < n. 
Observing that F{s) is strictly concave in the medium range, and concave and strictly increasing 
in the high range, it follows that, at the global minimum of ([1]), s{c) is equal to either A: or r except 
for one value in the high range and possibly one value strictly within the medium range. The value 
in the high range is at least rw, since the sum of the values in the low and medium ranges does 
not exceed r{w — 1). If X]c|fc<s{c)<r ^i'^) = {w — l)r — D, then the unique value of s(c) in the high 
range is rw + D. Moreover, exploiting the concavity of F{s) in the medium range, we claim that 
T^c\k<s{c)<r (fc^i) ^ {w — 1 — :^:^) log (^^-|^) . This bound is at most w log k greater than the sum 
Sc|fc<s(c)<r ik'-i) ■ Finally, a simple calculation shows that | J2c ™ax(0, n — {2w — l)s{c)) = 

Thus we get the following lower bound on X^c -^(^('^))- 

, . // , D \^ { r \ nD ^ (rw + D 
n - w[k + log k) + mm I w - I log , 1 + — h log 

0<D<{w~i){r~k) \\ r-kj \k-lj 2r \k-l 

Since this is a concave function it is minimized either aX D = Q or D = [w — l)(r — A;).This 
yields the following lower bound on the worst-case number of comparisons required to solve the 
/c-selection problem when k < r: 

- w{k + log fe) - ^ + min [{w - 1) log + log ( ™i) , + log ^^1'^')) . □ 

4.2.2 Lower bounds in the randomized query model 

We now prove lower bounds on the number of queries used by randomized /c-selection algorithms. 
We conjecture that the randomized algorithm for finding the minimal elements which we give in 
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the proof of Theorem 1121 essentially achieves the lower bound. However, the lower bound we prove 
here is a factor 2 different from this upper bound. 

We consider a distribution D{n,w) on partial orders of width w over a set P = {xi, . . . ,Xn}- 
The distribution D{n,w) is defined as follows: 

• The support of D{n,w) is the set of partial orders consisting of w chains, where any two 
elements from different chains are incomparable. 

• Each element belongs independently to one of the w chains with equal probability. 

• The linear order on each chain is chosen uniformly. 

Theorem 18. The expected query complexity of any algorithm solving the k-selection problem is 
at least 

w + 3 ( f ^ \\ A fn/{2w) 

Proof. In order to provide a lower bound on the number of queries, we provide a lower bound on 
the number of queries of incomparable elements and then use the classical bound to bound the 
number of queries of comparable elements. 

First we note that for each element a, the algorithm must make either at least one query where 
a is comparable to some other element b, or at least w — 1 queries where a is incomparable to all 
elements queried. (The latter may suffice in cases where a is the unique element of a chain and it 
is compared to all minimal elements of all other chains.) 

We let Yt{i) denote the number of queries involving Xj before the first query for which the 
response is that Xi is comparable to an element. Also for each of the chains Ci, . . . , we denote 
by Za the number of comparisons involving two elements from the same chain. 

Letting T denote the total number of queries before the algorithm terminates, we obtain: 



n ^ w 



2 

=1 a=l 



We claim that for all 1 < i < n we have E[Yj'(z)] > This follows by conditioning on the 

chains that all other elements but Xi belong to. With probability l/w, the first query will give a 
comparison; with probability l/w, the second query, etc. 

On the other hand, by the classical lower bounds we have for each 1 < a < w that 



Za > \Ca\ - A; + log 

Taking expected value we obtain 

n. 

log 



HZa] > k + B 

w 



\Ca\ 
k-l 



k-l 



A rough bound on the previous expression may be obtained by using the fact that by standard 
Chernoff bounds, except with probability exp(— ^), it holds that Ca is of size at least n/{2w). 
Therefore 



E 
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Summing all of the expressions above, we obtain 

(w-l)n /n \ / ^ n w ^ fn/(2w)\ 

—T^ + - (- - ^) + (i - -P (-^)) ( i. 1 J 

and simplifying gives the desired result. □ 



5 Computing linear extensions and heights 

In this section we consider two problems that are closely related to the problem of determining a 
partial order: given a poset, compute a linear extension, and compute the heights of all elements. 

A total order (P, >) is a linear extension of a partial order (P, y) if, for any two elements x 
and y, X y y implies x > y. We give a randomized algorithm that, given a set P of n elements and 
access to an oracle for a poset {P, >-) of width at most w, computes a linear extension of (P, y) 
with expected total complexity 0(n(logn + w)). We give another randomized algorithm that, on 
the same input, determines the height of every element of (P, y) with expected total complexity 
0{nw log n). 

The algorithms are analogous to Quicksort, and are based on a ternary search tree, an extension 
of the well-known binary search tree for maintaining elements of a linear order. A ternary search 
tree for (P, ^), consists of a root, a left subtree, a middle subtree and a right subtree. The root 
contains an element x G P and the left, middle, and right subtrees are ternary search trees for 
the restrictions of (P, >-) to the sets {y\x >- y}, {y\x ^ y} and {y\y >- x}, respectively. The 
ternary search tree for the empty poset consists of a single empty node. A randomized algorithm 
to construct a ternary search tree for (P, y) assigns a random element of P to the root, compares 
each of the n — 1 other elements to the element at the root to determine the sets associated with 
the three children of the root, and then, recursively, constructs a ternary search tree for each of 
these three sets. 

Define the weight of an internal node x of a ternary search tree as the total number of internal 
nodes in its three subtrees, and the weight of a ternary search tree as the sum of the weights of all 
internal nodes. Then the number of queries required to construct a ternary search tree is exactly 
the weight of the tree. 

Theorem 19. The expected weight of a ternary search tree for any poset of size n and width w is 
0{n{log n + w)). 

Proof Sketch. Consider the path from the root to a given element x. The number of edges in this 
path from a parent to a middle subtree is at most w, and the expected number of edges from a 
parent to a left or right subtree is O(logn) since, at every step along the path, the probability is at 
least 1/2 that the sizes of the left and right subtrees differ by at most a factor of 3. It follows that 
the expected contribution of any element to the weight of the ternary search tree is w+0(\og n). □ 

Once a ternary search tree for a poset has been constructed, a linear extension can be constructed 
by a single depth-first traversal of the tree. If x is the element at the root, then the linear extension 
is the concatenation of the linear extensions of the following four subsets, corresponding to the node 
and its three subtrees: {y\x y y}, {x}, {y\x ^ y} and {y\y y x}. The corollary below follows. 

Corollary 20. There is a randomized algorithm of expected total complexity 0(n(logn -|- w)) for 
computing a linear extension of a poset. 
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Let h{x) = hhe the height of element x in (P, y). Given a hnear extension Xn > ■ ■ ■ > X2 > xi, 
it is easy to compute h{x) for each element x by binary search, using the following observation: Let 
S{i, h) = {xj \j < i, h{xj) = h} be the set of elements of index at most i in the linear extension and 
of height h in (P,y). Then \S{i,h)\ < w (as the elements of S{i,h) are pairwise incomparable), 
and > /i if and only if there exists x £ S{i,h) such that Xi^i >- x. Thus, given the sets 

S{i,h), for all h, we can determine h{xi^i) and the sets S{i + l,h), for all h, in time 0{w\ogi) 
using binary search. This yields: 

Corollary 21. Given a linear extension, there is a deterministic algorithm with total complexity 
0{wn log n) to compute the heights of all elements of a partial order of size n and width w. Combin- 
ing this algorithm with the above algorithm for computing a linear extension, there is a randomized 
algorithm to determine the heights of all elements with expected total complexity 0{wn\ogn). 

6 Variants of the poset model 

In this section, we discuss sorting in two variants of the poset model that occur when different 
restrictions are relaxed. First, we consider posets for which a bound on the width is not known 
in advance. Second, we allow the irreflexivity condition to be relaxed, which leads to transitive 
relations. We show that with relatively little overhead in complexity, sorting in either case reduces 
to the problem of sorting posets. 

6.1 Unknown width 

Recall from Section [3] that Nu]{n) is the number of posets of width at most w on n elements. 

Claim 22. Given a set P of n elements and access to an oracle for poset V = {P, y) of un- 
known width w, there is an algorithm that sorts P using at most 2 log vu (log N2w{'n) -\- Awn) = 
Q{nlogw {logn + w)) queries, and there is an efficient algorithm that sorts P using at most 
8nwlogwlog{n/{2w)) queries with total complexity O {nw'^ log wlog{n/w)). 

Proof. We use an alternate version of EntropySort that returns fail if it cannot insert an 
element (while maintaining a decomposition of the given width) and an alternate version of Poset- 
Mergesort that returns fail if the Peeling algorithm cannot reduce the size of the decomposition 
to the given width. The first algorithm of the claim is, for i = 1,2,..., to run the alternate version 
of algorithm EntropySort on input set P, the oracle, and width upper bound 2*, until the 
algorithm returns without failing. The second algorithm is analogous but uses the alternate version 
of Poset-Mergesort. The claim follows from Theorems [6] and O and from the fact that we reach 
an upper bound of at most 2w on the width of V in log w rounds. □ 

6.2 Transitive relations 

A partial order is a particular kind of transitive relation. In fact, our results generalize to the case 
of arbitrary transitive relations (which are not necessarily irreflexive) and are therefore relevant 
to a broader set of applications. Formally, a transitive relation is a pair (P, >), where P is a set 
of elements and > C P x P is transitive. The width of a transitive relation is defined to be the 
maximum size of a set of mutually incomparable elements. We say that a poset (P, is induced 
by a transitive relation (P, ^) if >. A poset (P, y) is minimally induced by (P, >) if for any 
relation (x,y) G >\ y, the pair (P, >- U{x,y)) is not a valid partial order, i.e. its corresponding 
graph contains a directed cycle. 

We require the following lemma, bounding the width of a minimally induced poset. 
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Lemma 23. Let {P,y) be a poset minimally induced by the transitive relation (P, C^). Then the 
width of (P, y) is equal to the width of {P, >). 

Proof. Suppose otherwise, that is, suppose that there is a pair of distinct elements x,y G P such 
that X ^ y with respect to the partial order (P, :^), but x and y have some relation in (P, ^). 
Without loss of generality, suppose that x^y; it may be simultaneously true that y^x. First, we 
note that (P, >~ U (x, y)) is a valid partial order; if it were not, i.e. if the addition of {x, y) introduced 
a cycle, then it would be the case that y y x, which is a contradiction to their incomparability. 
However, the poset (P, >- U (x, y)) is also induced by (P, >), which contradicts the assumption that 
(P, is minimally induced. □ 

We denote by Oy an oracle for a poset (P, y) and by Oc> an oracle for a transitive relation (P, >). 
In the following claim, we assume that the poset sorting algorithm outputs a chain decomposition 
(such as a ChainMerge); if it does not, the total complexity of the algorithm for sorting a transitive 
relation increases a bit, but not its query complexity. 

Claim 24. Suppose there is an algorithm A that, given a set P of n elements, access to an oracle 
Oy for a poset V = {P,y), and an upper bound of w on the width ofV, sorts P using f[n,w) 
queries and g{n, w) total complexity. Then there is an algorithm B that, given P, w, and access 
to an oracle 0> for a transitive relation (P, >) of width at most w, sorts P using f{n,w) + 2nw 
queries and g{n,w) + 0{nw) total complexity. 

Proof. Given an oracle Ci> for the transitive relation (P, ^), we define a special poset oracle O 
that runs as follows: Given a query q{x,y), the oracle O first checks if the relation between x 
and y can be inferred by transitivity and irreflexivity from previous responses. If so, it outputs 
the appropriate inferred response; otherwise, it forwards the query to the oracle Or>. The oracle 
O outputs the response of except if both x > y and y > x; in this case, O outputs whichever 
relation is consistent with the partial order determined by previous responses (if both relations 
are consistent, then it arbitrarily outputs one of the two). By definition, the responses of O are 
consistent with a partial order induced by (P, >). 

The first step of algorithm B is to run algorithm A on input P and w, giving A access to the 
special oracle O, which B simulates using its access to 0\>. Since A completely sorts its input, it 
reconstructs a poset induced by (P, >) via O that has a maximal set of relations. That is, there is 
a poset V = (P, y) minimally induced by (P, C>) such that the responses of O to the sequence of 
queries made by A are indistinguishable from the responses of to the same sequence of queries. 
Since V has the same width as (P, >), it is valid to give A the upper bound of w. Hence, A sorts 
V and outputs some chain decomposition C = {Ci, . . . Cq} of V such that q < w. 

The second step of algorithm B is to make a sequence of queries to the oracle to recover the 
relations in >\ y. It is similar to building a ChainMerge data structure: for all i,j, I < i, j < q, 
for every element x € Cj, we store the index of x in chain Ci and the index of the largest element 
y G Cj such that x>y. An analysis similar to the one for ChainMerge (see Section [2]) shows that 
it takes at most 2nq queries to the oracle Oi> and 0{nq) total complexity to find all the indices. 
The relation in (P, >) between any pair of elements can then be looked up in constant time. □ 
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